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ABSTRACT 

In  this  paper  we  present  a  statistical  model  with  a  non- 
symmetric  half-plane  (NSHP)  region  of  support  for  two- 
dimensional  continuous- valued  vector  fields.  It  has  the  sim¬ 
plicity,  efficiency,  and  ease  of  use  of  the  well-known  hid¬ 
den  Markov  model  (HMM)  and  associated  Baum- Welch  al¬ 
gorithms  for  time- series  and  other  one-dimensional  prob¬ 
lems.  At  the  same  time,  it  is  able  to  learn  textures  on  a  two- 
dimensional  field.  We  describe  a  fast  approximate  forward 
procedure  for  computation  of  the  joint  probability  density 
function  (PDF)  of  the  vector  field  as  well  as  an  approximate 
Baum- Welch  algorithm  for  parameter  re-estimation.  We  test 
the  method  using  synthetic  textures. 

1.  INTRODUCTION 

1.1  Goal 

Let  Sij  be  a  discrete  random  variable  taking  values  between 
1  and  M  on  a  two-dimensional  field  1  <  i  <  N\,  1  <  j  < 
N2.  Assume  that  we  cannot  observe  Sij  directly  but  instead 
observe  a  D-dimensional  continuous  valued  random  vector 
“pixel”  Xf  j  G  &D  whose  probability  density  function  (PDF) 
depends  on  the  state  at  pixel  /,  j  and  is  denoted  by  p(x.ij\sij). 
We  would  like  to  (1)  model  the  statistical  behavior  of  the 
discrete  random  field  (2)  estimate  the  PDFs  /?(x|,s)  and  (3) 
determine  the  joint  PDF  of  the  entire  field  X  =  {xij  :  1  < 
i  <N 1 ,  1  <  j  <  N2  given  the  model  parameters. 

1.2  Previous  Work 

Real-world  patterns  and  textures  exhibit  complex  statistical 
dependencies  between  neighboring  pixels.  In  order  to  model 
these  dependencies  in  a  tractable  way,  we  often  assert  the 
Markov  property.  This  property  holds  that  the  state  of  the 
system  at  a  given  pixel  conditioned  on  the  entire  field  or  part 
of  the  field  can  be  expressed  in  terms  of  only  the  states  of  the 
of  neighboring  pixels.  The  choice  of  the  neighborhood  has 
a  profound  affect  on  the  properties  of  the  model  [1]  [2],  [6]. 
We  can  classify  the  models  into  two  subclasses:  causal  and 
non-causal. 

In  non-causal  models,  the  conditional  statistical  depen¬ 
dence  extends  in  all  directions  and  the  neghborhood  of  a  pixel 
surrounds  the  pixel  itself.  Although  this  is  probably  a  better 
representation  of  real-world  processes,  the  statistical  analysis 
of  such  fields  requires  the  mathematics  of  Gibbs  fields  and  its 
associated  limitations.  Limitations  include  (1)  the  necessity 
of  using  iterative  methods  to  generate  synthetic  fields  from 
the  model,  (2)  the  inability  to  find  a  closed-form  expression 


for  the  constant  of  proportionality  of  the  probability  distribu¬ 
tions  -  a  necessity  for  classification,  and  (3)  reliance  on  en¬ 
ergy  “clique”  functions  which  are  non-intuitive  and  indirect 
representations  of  the  statistical  dependencies. 

In  causal  models,  the  neighborhood  system  is  one-sided 
with  statistical  dependence  expressed  only  in  terms  of  “past” 
data.  This  allows  computation  of  the  joint  probability  density 
function  (PDF)  of  the  entire  field  of  vectors  in  a  recursive 
manner.  Additional  advantages  are  the  reliance  on  direct  in¬ 
tuitive  conditional  dependencies  and  the  ability  to  synthesize 
random  fields  in  one  pass.  Among  causal  models  the  Markov 
Mesh  is  a  very  tractable  model  that  expresses  dependencies 
in  terms  of  just  two  neighboring  pixels  [4],  [5].  The  simplic¬ 
ity  of  the  Markov  mesh  models,  however,  may  limit  the  use¬ 
fulness  of  the  model  [1],  [6].  The  non-symmetric  half-plane 
(NSHP)  region  of  support  (ROS)  consists  of  four  neighbor¬ 
ing  pixels  and  provides  the  most  general  causal  model  [6] . 

The  discussion  up  to  now  has  assumed  that  Sij  are  di¬ 
rectly  observable.  When  statistically  modeling  X,  another 
layer  of  complexity  is  added  especially  when  trying  to  es¬ 
timate  model  parameters.  In  the  case  of  one-dimensional 
problems  such  as  in  speech  recognition,  the  parameter  es¬ 
timation  problem  is  handled  very  efficiently  using  the  Baum- 
Welch  algorithm  [3].  Unfortunately,  the  forward  procedure 
and  backward  procedure ,  components  of  the  Baum- Welch  al¬ 
gorithm  which  recursively  and  efficiently  compute  the  joint 
PDF  of  the  field  of  noisy  measurments,  do  not  generalize  to 
the  NSHP  two-dimensional  model. 

In  this  paper,  we  demonstrate  approximate  equivalents  to 
the  forward  procedure  and  backward  procedure.  The  result 
is  an  approximate  Baum- Welch  algorithm  for  estimating  the 
parameters  of  the  NSHP  Markov  model  as  well  as  the  obser¬ 
vation  PDFs. 


1.3  Non-symmetric  half-plane  (NSHP)  Model. 

The  NSHP  neighborhood  system  has  been  used  for  two- 
dimensional  autoregressive  analysis  and  image  and  texture 
analysis  [7], [8],  [6].  The  NSHP  model  is  based  on  ordering 
the  pixels  in  an  image  by  scanning  in  a  raster  pattern  (see  fig¬ 
ure  1).  As  we  scan  the  plane  from  left  to  right,  and  bottom  to 
top,  when  we  reach  pixel  “A”,  we  have  already  visited  “B”, 
“C”,  “D”,  and  “E”.  Because  each  of  the  pixels  in  the  ROS  are 
in  the  “past”,  we  can  construct  a  recursive  forward  procedure 
that  computes  state  probabilities  based  on  “past”  pixels. 
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Figure  1 :  Illustration  of  the  neighbohood  system  of  the  non- 
symmetric  half  plane  model  (NSHP).  Pixel  A  is  represented 
statistically  in  terms  of  neighbors  B,C,D,  and  E. 


2.  MATHEMATICAL  ALGORITHM  DESCRIPTION 

2.1  Notation  and  mathematical  definition  of  the  NSHP 
model. 

Let  x/j  G  &D  be  the  dimension  D  data  (feature  vector)  at 
pixel  ij  where  1  <i<N\ ,  1  <  j  <  A^.  We  define  the  “past 
data”  X</j>  as  all  data  occurring  “before”  x;j-  that  is  either 
to  the  left  or  below  and  not  including  x?-j.  More  precisely, 

<  hj  >=  I*',/}  :  (j'  <  i  AND  /  =  j)  OR  (/  <  j). 

We  also  define  the  “past  and  current  data”  X*  j  as  the  union 
of  X</j>  and  x?  J-.  We  define  p?  j(m|X<i  j>)  as  the  a  priori 
probability  of  state  m  at  pixel  /,  7  given  all  data  up  to  but  not 
including  x/j.  Similarly,  we  define  p;  j  (m|X;  j  )  as  the  a  pos¬ 
teriori  probability  of  state  m  at  pixel  /,  j  given  all  data  up  to 
and  including  x*j.  Let  the  index  variables  r,sf,u  index  over 
the  states  of  the  neighbor  pixels  (“B”,  “C”,  “D”,  and  “E”, 
respectively  in  figure  1).  Let  M  be  the  number  of  possible 
states. 

2.2  NSHP  forward  procedure. 

The  forward  procedure  in  HMM  terminology  [3]  recursively 
computes  the  state  probabilities  at  each  pixel.  It  also  com¬ 
putes  the  joint  probability  of  all  the  data  given  the  model, 
which  is  needed  in  a  classifier.  We  can  recursively  compute 
the  state  probabilities  as  follows.  In  the  first  step  of  the  for¬ 
ward  procedure  we  compute  the  a  priori  probability 

M  M  M  M 

Pij  (m  I  X<j J> )  =  XI  L  L  L  Pi>j  (m>  r>  s> 1  > u  I  X<U> ) 

r=  1  s=\t=\  u=  1 
M  M  M  M 

r=l  s=lt=l  u=  1 

Pip- 1  mj-l)  Pi+lJ—l  (l|X/+u_i) 

’Pi—  IJ (w|X/_  1  j)  (tw) 7 

(1) 

where  C r,s,t,u(m)  is  the  M4  x  M  state  transition  matrix.  The 
expression  that  replaces  pij(m,r,s,t,u\'X.<ij>)  uses  only 
quantities  that  have  been  previously  computed.  It  is  approx¬ 
imate  because  (a)  the  data  conditions  used  in  the  terms  on 


the  right  hand  side  (RHS)  are  not  equivalent  to  the  data  con¬ 
dition  X</j>  used  on  the  left  hand  side  (LHS)  and  (b)  the 
joint  probability  pij(m,r,s,t,u |X<;j>)  is  approximated  by 
the  product  of  the  marginals.  Although  the  RHS  expression 
is  approximate,  it  (a)  depends  only  upon  data  that  is  strictly 
contained  in  the  condition  X</j>  and  (b)  is  a  probability  on 
(m,  r,  s,  t ,  u) .  Thus,  the  result  is  still  a  valid  PDL  and  produces 
in  the  end  a  valid  PDL  on  X.  This  is  an  important  observation 
because  the  maximization  of  the  resulting  approximate  data 
field  PDL  over  the  model  parameters  would  not  be  a  valid 
approach  if  this  were  not  true. 

The  a  posteriori  probability  is  denoted  by  odtf  and  is 
computed  by  updating  (1)  using  the  data  at  pixel  /,  j 


am  ~  Pij(m\Xij) 


p,-j(m|X</j>)p(x,-j|m) 


M 


E 


Pi,}  (n  I  X<;  j>  )/>(xj  j  |n) 


(2) 


where  p(x|m),  m  =  1 . . .  Af,  are  the  state  PDLs  represented 
as  Gaussian  mixtures  [3]. 

The  joint  PDL  of  the  data  up  to  and  including  x/.;  can  be 
recursively  computed  in  parallel: 

=  p(Xi-u)  p(xij\Xi-u) 

—  pfri-ij)  p(xij\X<ij>) 

M 

=  p(Xi-u)  £ 

m=  1 

(3) 

When  the  algorithm  reaches  the  last  pixel  (i  =  Afi,  j  =  N2), 
the  joint  PDL  of  the  entire  field  is  given  by 


L(X)  = 


2. 2. 7  Initialization 

The  forward  procedure  (1),  (2),  (3)  assumes  that  the  altf  at 
all  four  neigbors  have  already  been  calculated.  To  initialize 
the  algorithm,  we  begin  with  the  first  pixel  (i  =  1,  j  =  1).  Lor 
this,  we  need  the  a  priori  state  probabilities  n(m ),  1  <m< 
M.  Then,  we  can  progress  through  the  first  row  of  pixels  us¬ 
ing  the  well-known  HMM  forward  procedure.  Lor  this  we 
need  the  1-D  state  transition  matrix  (STM)  Au{m)  =  Pr(st  = 
m\st-\  =  u).  When  we  reach  the  first  pixel  of  the  second  row, 
we  need  a  reduced  version  of  C r,s,t,u(m)  since  there  is  no 
pixel  immediately  to  the  left  (pixel  “E”  in  figure  1).  There  is 
also  no  pixel  underneath  and  to  the  left  pixel  “B”  in  figure  1), 
however  if  we  choose,  we  can  wrap  data  around  to  that  the 
last  pixel  on  the  first  row  takes  the  place  of  pixel  “B”.  Us¬ 
ing  this  approach  of  wrapping  around  pixel  “B”,  the  reduced 
STM  is  Ap  —by  —  M  and  is  denoted  by  Br^(ra).  The  ini¬ 
tialization  is  a  straight-forward  simplification  of  (1),  (2),  (3) 
made  by  removing  the  index  u  and  associated  factors. 

2.2.2  Synthesis  of  Random  fields. 

Just  as  the  forward  procedure  is  analogous  to  the  forward 
procedure  of  the  HMM,  the  synthesis  of  discrete  random 
fields  is  directly  analogous  to  the  Markov  chain.  To  syn¬ 
thesize  a  random  field  from  the  model,  we  first  synthesize  a 
discrete  random  field,  then  as  a  second  step,  synthesize  fea¬ 
ture  values  from  the  state  PDLs  given  the  chosen  states  at 
each  pixel.  State  synthesis  is  an  algorithm  directly  parallel 
to  the  forward  procedure:  we  use  the  STM  to  determine  the 


state  probabilities  of  the  M  states  at  a  given  pixel  given  the 
already-chosen  states  of  the  “past”  neighbors,  select  the  dis¬ 
crete  random  variable,  then  continue  to  the  next  pixel. 

2.3  NSHP  backward  procedure. 

Only  the  forward  procedure  is  needed  to  compute  total  field 
probability  given  the  model.  The  time-reversed  version, 
called  the  backward  procedure  [3],  is  needed  as  a  component 
of  the  Baum  Welch  algorithm  to  re-estimate  the  model  param¬ 
eters  [3].  Unfortunately,  it  is  impractical  to  create  a  NSHP 
version  of  the  backward  procedure  that  follows  the  spirit  of 
the  one-dimensional  version  because  the  NSHP  neighbor¬ 
hood  system  is  not  a  simply  ordered  Markov  model.  The 
state  transition  matrix,  C r,s,t,u(m)  in  eq.  1),  is  of  dimen¬ 
sion  M4-by-M.  To  implement  the  backward  procedure,  we 
would  require  the  backward  version  of  C r,sj,u(m),  denoted 
by  C rStu{m),  which  must  also  be  of  the  same  dimensions. 
In  the  one-dimensional  backward  procedure,  it  is  simply 
the  transpose  of  the  forward  STM.  The  backward  procedure 
uses  backward  versions  of  the  reduced  STMs  for  initializa¬ 
tion  analogous  to  the  forward  procedure  (denoted  by  7lb(m ), 
Abu{m),  B brst{m)).  For  the  NSHP  backward  procedure,  we 
have  two  choices,  resulting  in  two  different  algorithms: 

1.  NSHP-Symmetric  algorithm:  Assume  that  the  problem 
is  180-degree  symmetric  and  use  Cbs  t  u  (m)  =  C r,s,t,u(m)- 
This  can  be  used  for  textures  that  are  symmetric  (in  the 
sense  of  rotating  them  by  180  degrees). 

2.  NSHP-Non-symmetric  algorithm:  Estimate  Cbstu  (m) 
separately.  This  adds  to  the  number  of  parameters 
that  must  be  estimated  but  allows  applicability  to  non- 
symmetric  textures. 

Let  Y  ij  and  Y  <;j>  represent  future  data  with  respect  to  x;j, 
inclusive  and  exclusive  of  x*  j,  respectively.  The  NSHP  back¬ 
ward  procedure,  analogous  to  (2)  is  given  by 

Pm  =  Pij(w|Y<(-j>) 

M  M  M  M 

-  ILL!  Pi+ 1  j'+ 1  (r  |  Y  ,■+ 1 J+ 1 ) 

r=  \  s=  \  t=  \  u=  \  (4) 

■PiJ+ 1  (4 YiJ+ 1 )  Pi- 1  J+l  (t  \ Y i- 1  ,j+ 1 ) 

■pi+lj(u\Yi+lJ)  Cbrstu{m). 

To  continue  the  recursion,  we  also  need  the  data-updated 
probabilities  of  J 8J ,  denoted  by  8J: 

8m  =  PijHYij)  =  Pm  P(xi,j\m)- 

Equation  (4)  can  then  be  rewritten 

M  M  M  M 

Pm  =  E  EE  E  #+ 1:7+1  8‘'J+]  C|J+I 

r=  1  s=  1 1=  1  u=  1  (5) 

«+U  C bsAu(m)- 

Initialization  is  analogous  to  the  forward  procedure. 


2.4  NSHP  Baum- Welch  algorithm. 

We  now  describe  an  algorithm  that  parallels  the  well-known 
Baum- Welch  algorithm  for  the  HMM  [3].  The  state  probabil¬ 
ities  given  all  the  data  are  denoted  by  yin  HMM  terminology 
[3].  Let 

YmJ  =  Pi  j(m\X), 

where  X  is  all  the  data.  We  have 


yj  ~ 

im  — 


a, 


ia  ohj 


E<  P‘kJ 


Note  that  denominator  is  just  the  numerator  summed  over  m, 
so  computing  ytf  can  be  done  in  two  steps:  (a)  multiplying 
alm  and  j 3m  together,  then  (b)  normalizing  so  that  YJ  sums 
to  1  over  m. 

We  can  use  y  to  detect  likely  instances  of  certain  states 
in  the  data.  We  can  also  use  it  to  re-estimate  state  transi¬ 
tion  probabilities  C r,s,t,u(m)  as  well  as  the  state  likelihood 
functions  p(xy  |m),  which  are  represented  by  Gaussian  mix¬ 
tures.  We  can  approximate  the  Baum  Welch  algorithm  to  re- 
estimate  C r,s,t,u(m)  using 


Li  Lj  Ymj  i  '  YsJ  ' 
■Y  lJ  lYu  lJ- 


(6) 


followed  by  a  normalization  operation  to  insure  that 

Y<Cr,s,t,u(m)  =  L 

m 


The  same  formula,  applied  on  the  180-degree  rotated  data 
field  can  be  used  to  estimate  Cbstu(m)  if  for  the  non- 
symmetric  option.  Re-estimation  of  state  PDFs,  which  can 
be  represented  by  Gaussian  mixtures,  follows  the  1-D  ap¬ 
proach  [3]. 


3.  EXPERIMENTAL  RESULTS 

3.1  Synthetic  Data 

We  created  a  1 80-degree  symmetric  synthetic  data  class  from 
a  notional  blue  ellipse  filled  with  green  on  a  red  background. 
Examples  of  the  data  are  shown  in  figure  2.  The  data  are 
three-dimensional  (RGB)  feature  vector  with  additive  Gaus¬ 
sian  noise.  The  ellipse  is  randomly  positioned  in  the  plane. 
We  also  created  a  non- symmetric  synthetic  texture  data  class 
from  repeating  triangles  in  a  green  field  (figure  3). 

3.2  NSHP-Symmetric  Model 

We  trained  the  NSHP-symmetric  model  with  M  =  3  discrete 
states  on  30  samples  of  the  symmetric  texture  from  figure 
2.  The  state  PDFs  were  represented  by  a  single  Gaussian 
(mixture  with  1  component)  and  was  initialized  randomly  in 
thirty-two  random  trials.  The  algorithm  was  halted  if  the  log- 
likelihood  began  to  decrease.  The  best  random  trial  was  used. 
A  representation  of  the  y  probabilities  for  one  data  sample  is 
shown  in  figure  4.  This  can  be  compared  with  the  data  in 
figure  2.  The  NSHP  model  easily  generates  random  fields 
as  shown  by  the  example  in  figure  5.  The  random  fields  can 
be  compared  with  figure  4.  Clearly  the  NSHP  model  has  suc¬ 
ceeded  in  learning  the  synthetic  pattern.  When  we  trained  the 


Figure  2:  One  example  of  the  symmetric  “ellipse”  synthetic 
data  field.  Rotating  the  image  by  180  degrees  produces  an 
equally  valid  sample.  Data  is  a  three-dimensional  feature 
vector  rendered  as  an  RGB  image. 


Figure  4:  Gamma  probabilities  for  the  example  in  figure  2. 
The  image  is  coded  with  a  color  representing  the  most  likely 
state  as  determined  by  ytt .  Red:  state  1,  green:  state  2,  blue: 
state  3. 
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Figure  3:  One  example  of  the  non- symmetric  synthetic  data 
field.  Rotating  the  image  by  180  degrees  does  not  produce 
a  valid  sample.  Data  is  a  three-dimensional  feature  vector 
rendered  as  an  RGB  image. 


symmetric  model  on  the  non- symmetric  data  and  synthesized 
random  fields,  we  obtained  figure  6.  As  we  would  expect,  we 
see  character  in  the  synthetic  pattern  of  triangles,  but  there  is 
no  apparent  up  or  down. 

3.3  NSHP  Non-Symmetric  Model 

When  we  trained  the  non- symmetric  model  on  the  non- 
symmetric  texture  in  figure  3  and  synthesized  random  fields, 
we  obtained  figure  7.  Here,  we  see  clearly  the  upward- 
pointing  triangles. 


Figure  5:  Synthetic  random  field  generated  by  the  NSHP 
symmetric  model  trained  on  the  data  in  figure  2. 


random  field.  Since  each  feature  vector  from  the  origi¬ 
nal  field  is  replicated  5  times,  the  new  higher-dimensional 
field  is  redundant.  Nevertheless,  the  forward  procedure  just 
described  provided  a  PDF  value  for  the  field  of  extended 
vectors.  When  we  synthesize  random  fields  from  the  non- 
symmetric  model  on  extended  feature  vectors,  and  obtained 
figure  8,  there  is  the  appearance  of  better  texture  synthesis 
than  figure  7. 

3.5  Conditional-Extended  feature  (CEF)  vector  model. 

As  we  explained,  the  forward  procedure  operating  on  the  ex¬ 
tended  feature  vectors  produces  an  exact  PDF  of  the  field  of 


3.4  Extended  feature  (EF)  vector  model. 

In  hidden  Markov  models,  to  which  the  algorithm  we  have 
described  belongs,  the  statistical  relationship  between  a  pixel 
and  its  neighbors  is  based  on  discrete  states  only.  If  the  num¬ 
ber  of  states  is  small,  we  do  not  have  the  benefit  of  accu¬ 
rately  predicting  the  color  of  a  given  pixel  from  the  color  of 
its  neighbors  except  through  their  discrete  states.  This  can 
limit  the  usefulness  of  the  models.  From  a  strict  application 
of  Bayes  theorem  to  develop  a  joint  PDF  of  the  noisy  vector 
field,  there  is  no  reason  we  cannot  also  use  the  feature  val¬ 
ues  in  the  neighboring  pixels  to  assist  in  the  prediction.  To 
this  end,  we  create  at  each  pixel  an  extended  feature  vector 
x.et  j  E  &5D  which  is  of  dimension  5 D  because  it  includes  the 
feature  vectors  of  the  current  pixel  plus  the  four  neighbors . 

It  must  be  kept  in  mind  that  by  using  extended  feature 
vectors,  we  are  in  effect  creating  a  new,  higher-dimension 


Figure  6:  Synthetic  random  field  generated  by  symmetric 
NSHP  model  trained  on  the  data  in  figure  3.  Left:  states, 
Right:  color  image.  There  is  no  apparent  up  or  down  evident 
in  the  synthetic  field. 


Figure  7:  Synthetic  random  field  generated  by  non- 
symmetric  NSHP  model  trained  on  the  data  in  figure  3.  Left: 
states,  Right:  color  image.  In  contrast  to  figure  6,  there  is  a 
clear  notion  of  up  and  down  evident  in  the  synthetic  field. 


Figure  9:  Synthetic  random  field  generated  by  non- 
symmetric  NSHP  model  with  conditional  extended  feature 
(CEF)  vectors  trained  on  the  data  in  figure  3.  Left:  states, 
Right:  color  image. 


1 0  20  30  40  50 


Welch  algorithm  for  the  one-dimensional  HMM.  We  have 
also  described  variations  that  utilize  extended  feature  vectors 
to  permit  effective  feature  prediction  that  would  not  other¬ 
wise  be  possible  with  doubly- stochastic  models.  We  have 
also  demonstrated  the  algorithm’s  ability  to  learn  textures. 
Based  on  texture  synthesis,  the  extended  feature  vector  (EF) 
model  appears  to  be  superior.  We  found  this  also  to  be  the 
case  in  texture  classification  experiments  which  space  limi¬ 
tations  do  not  allow  us  to  describe. 


Figure  8:  Synthetic  random  field  generated  by  non- 
symmetric  NSHP  model  with  extended  feature  (EF)  vectors 
trained  on  the  data  in  figure  3.  Left:  states,  Right:  color  im¬ 
age. 


extended  feature  vectors.  Exact  is  meant  in  the  sense  that  for 
a  fixed  set  of  parameters,  it  integrates  to  exactly  1 .  This  is 
useful  in  classification  of  textures  provided  all  texture  mod¬ 
els  being  compared  use  the  same  EF  model.  However,  if  we 
need  the  PDF  of  the  original  field,  we  must  use  the  condi¬ 
tional  PDFs.  Let 


where  x* j  is  the  feature  vector  at  pixel  /,  j  and  w ij  is  the 
4D-by-l  vector  of  features  from  the  four  NSHP  neighbors. 
In  the  forward  procedure,  we  use  in  place  of  the  state  PDFs 
the  conditional  PDF  p(x.ij\m, w/j).  The  necessary  condi¬ 
tional  PDFs  can  be  generated  easily  from  Gaussian  mixtures 
in  closed  form,  even  with  more  than  one  mixture  component. 
We  synthesize  random  fields  from  the  CEF  model  and  ob¬ 
tained  figure  9.  We  see  a  degradation  wih  respect  to  figure  8. 


4.  CONCLUSIONS 

We  have  presented  a  statistical  model  for  two-dimensional 
fields  of  pixels  where  each  pixel  is  represented  by  a 
continuous-valued  feature  of  arbitrary  dimension.  The  model 
is  directly  analogous  to  the  one-dimensional  hidden  Markov 
model  (HMM).  We  have  described  an  fast  approximate  for¬ 
ward  procedure  that  recursively  computes  the  joint  PDF  of 
the  entire  data  field  as  well  as  an  algorithm  for  re-estimation 
of  model  parameters  modeled  after  the  well-known  Baum- 
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